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Abstract — In this paper a scheduling policy is presented which 
minimizes the average delay of the users. The scheduling scheme 
is investigated both by analysis and simulations carried out in the 
context of Orthogonal Frequency Division Multiplexing (OFDM) 
broadcast channels (BC). First the delay optimality is obtained 
for a static scenario providing solutions for specific subprob- 
lems, then the analysis is carried over to the dynamic scheme. 
Furthermore auxiliary tools are given for proving throughput 
optimality. Finally simulations show the superior performance of 
the presented scheme. 

I. Introduction 

The allocation of limited resources among users is a fun- 
damental problem in the design of next generation wireless 
systems. In general, resource allocation problems can be 
formulated as some kind of optimization problem where the 
objective is to maximize/minimize some system performance 
measure under physical layer as well as Quality of Service 
(QoS) constraints. One of the most important performance 
measures of a communication systems is the total system 
throughput and therefore it is often considered as the objective 
of the optimization problem. In a queueing system with 
random packet arrival, the throughput can be considered as the 
maximal possible offered load without violating the stability 
of queues. A scheduling policy is called throughput-optimal, 
if it can keep the queues stable whenever any other feasible 
scheduling scheme can stabilize the queues. It was shown 
that there exist several queue-length-based scheduling schemes 
which achieve throughput optimality [1], [2], [3], [4]. 

However, since the stability of a queueing system only 
guarantees that the queue lengths do not grow without bounds, 
but by no means indicate how long the queue length will be, 
the next step in the performance optimization is to keep the 
queue lengths as short as possible so that the queueing delay 
is minimized. It was shown in [5], [6] that the Longest-Queue- 
Highest-Possible-Rate (LQHPR) policy which maximizes the 
queue-weighted sum of rates is throughput-optimal and is 
strongly delay-optimal for the multiple-access channel. The 
necessary condition for its delay optimality is the symmetry 
both in the fading channels and in the packet arrival rates. 
However, for the BC the LQHPR is not delay-optimal even 
with symmetry assumptions. Seong et al. introduced in [4] 
another throughput-optimal scheduling called Queue Propor- 
tional Scheduling (QPS) which provides superior delay and 
fairness properties for the BC compared to LQHPR. 

Generally, the aforementioned policies are based on the 
same class of optimization problems: Maximizing the sum of 



rates weighted with different parameters, i.e. queue length, 
delay, etc. The solution of the optimization problem always 
corresponds to some boundary point of the channel capacity 
region. Since the capacity region of OFDM BC is com- 
pletely achieved with Costa Precoding and the optimal power 
allocation and precoding order for the weighted sum rate 
maximization problem can be efficiently solved [7], all results 
can also be easily extended to OFDM BC systems. 

In this paper, we analyze characteristics of all throughput- 
optimal scheduling policies and show that they can be formu- 
lated as weighted sum rate maximization problems differing 
only in the choice of the weight factors. Further, the weight 
factors are independent of the current channel states, hence 
cross-layer optimization problems, which usually involve the 
optimization over system parameters in medium access control 
(MAC) layer and physical layer, can be clearly separated into 
two steps: 1. Finding the optimal weight factors according 
to the MAC layer parameters. 2. Solving the weighted sum 
rate maximization problem. Then we introduce an iterative 
algorithm to calculate the weight factors that are optimal with 
respect to the average delay in OFDM BC channels. Here the 
average delay is defined as the average waiting time of each 
bit in the queues. 

The rest of this paper is organized as follows. Section 
II presents the system model. The throughput optimality is 
discussed in Section III. In Section IV we introduce our delay- 
optimal scheduling policy and evaluate results in Section V. 
Finally, we conclude in Section VI. 

II. System Model 

A. Physical layer 

We assume an OFDM BC with M users, K subcarriers, 
and a short term sum power constraint P 
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where x m ^ is the signal transmitted to user to G A4 = 
{1, M} on subcarrier k S K, = {1, K} with power p m ^ 
and E{.} stands for the expectation operator. Then, the system 
equation for each user on each subcarrier can be written as 
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where y m ,k is the signal received by user m on sub- 
carrier k, n m ^ ~ CAf(0, (J 2 ) is circular symmetric ad- 
ditive white Gaussian noise with variance a 2 . Let h = 
. . . , hi^x, h 2 ,i 7 ■ ■ ■ , ■■■hM,K] T denote the stacked vector 
of channel coefficients. We assume that these channel coeffi- 
cients are related to a standard time-varying multipath model 
where the channel is approximately constant over the OFDM 
symbol. Furthermore, we assume that Costa Precoding is 
performed at the base station having full non-causal knowledge 
of all messages to be transmitted. Let tt G II be an arbitrary 
encoding order from the set of all Ml possible encoding 
orders, such that user 7r(l) is encoded first, followed by user 
7r(2) and so on. Then the rate of user 7r(m) can be expressed 
as 
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with g m _k = |/i m ,fc| 2 /cr 2 being the channel gain of user m 
on subcarrier k and p m ,k being the allocated power. The 
instantaneous capacity region of the OFDM BC under a given 
sum power constraint P is given by 
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where f^( m \ is defined in equation (2) and r denotes the 
vector of rates. Now, the ergodic capacity region C erg (P) is 
defined as the set of achievable rates averaged over the channel 
realizations subjected to the short term sum power constraint 
P: 
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B. Medium access control layer 

Assuming that the transmission is time-slotted, data packets 
arrive randomly at the MAC and a buffer with finite length 
is reserved to store the incoming data for each user m G M. 
Simultaneously the data is read out from the buffers according 
to the system state, i.e., the random fading realization and the 
current queue lengths. Thus, the system can be modeled as a 
queueing system with random processes reflecting the arrival 
and the departure of data packets. 

Denoting the buffer state of the m-th buffer in time slot 
n G N by q m (n) and arranging all buffer states in the vector 
q(n) G the evolution of the queue system can be written 
as 

q(n+l) = [q(n)-r(n)] + +a(n), (5) 

where [2]+ = max{0,a; m }, Vrn G M. and a(n) G R* f 
is a random vector denoting the data arrival process. The 
random vector r (n) G describes the rates asserted to 
the individual users according to a specific scheduling policy. 
Supposing that a scheduling policy is a mapping 



which decides the rate allocation depending on the current 
channel fading state h G C MxK and the MAC layer system 
state w G S. w summarizes the current and past information 
that is acquirable at the base station and relevant for the 
optimization, e.g., the current queue length q, the average 
previous arrival rate a and average previous transmit rate r, 
etc. The rate allocation according to the scheduling policy V 
is denoted as r^hjw). 

Note that the process is reminiscent of random walk on the 
half line (with dependent increments) where we have rigor- 
ously used an uncountable state space formulation. Since the 
random variables a(n) are sampled at a given time interval T 
from M independent random processes they are independent. 
Denoting the mean of the packet arrival rate of user m as A TO 
and the constant packet size as s m , the expected bit arrival 
rate for user m is given by p m = s m X m . On the other hand 
the random vector r (n) depends on the buffer and channel 
state. 

III. System Stability and Throughput-optimal 
Scheduling Policies 

A. Definition of system stability and throughput optimality 

First we investigate the maximum possible offered system 
load without violating stability. There exist several definitions 
of stability. In this paper we use the definition of the strong sta- 
bility, which implies also weak stability and nonevanescence 
of the queueing system. 

Definition 1: The queueing system is strongly stable, if 

limsupE{^(n)} < +00, Vi G M. (6) 
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In the sense of the stability definition, we call the set 
of expected arrival rates p stabilizable by a specific sched- 
uler the throughput region of the scheduling policy V. A 
scheduling policy is throughput-optimal if it stabilizes the 
system whenever any other scheduling policies can stabilize 
the system. If the arrival rate p £ C erg (P) and the fading 
gains can be practically upperbounded by some constants, then 
it is impossible to stabilize the system, even if the policy 
is non-stationary and it has knowledge of the future events 
[7]. Therefore, we can define a throughput-optimal scheduling 
policy as a policy, which keeps the system stable for any arrival 
rate whose expected value p lies in the ergodic capacity region. 
For example, using Lyapunov drift technique the LQHPR 
scheduler can be proven to be throughput optimal. 

B. Characterization of throughput-optimal scheduling policies 

For a given channel state, the rate vector allocated with 
a throughput-optimal policy is always a boundary point of 
the instantaneous capacity region C(h, P). Therefore, any 
throughput-optimal scheduling policy can be formulated as the 
optimization problem 



r' p (h, w) = argmax/x T • r, 
rec(h,p) 
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where p is the normal vector of the boundary surface at the 
allocated rate point. Since the boundary surface of the OFDM 



BC channel is continuous and differentiable, the rate allocation 
r v (h, w) can be uniquely characterized with the normal vector 
on the capacity region. 

Furthermore, for any given fi the power and rate allocation 
problem (7) can be efficiently solved [7]. Hence, we can 
use the vector fi, which is also called weight vector in 
the optimization problem (7), to characterize the scheduling 
decision, instead of using power and rate allocation directly. 

In the following we show some properties of the weight 
vector fi. 

Theorem 1: The weight vector fi which characterizes a 
throughput-optimal scheduling policy is independent of the 
current fading state h. 

Proof: We choose arbitrarily a weight vector fi* corre- 
sponding to a fixed boundary point of the ergodic capacity 
region, hence fi* is independent of the instantaneous channel 
state. Then we denote fj, h the weight vector determined by a 
scheduling policy V. We have 
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The equality holds only if fi h = fi* and the boundary point 
is achieved by the corresponding scheduler, otherwise the 
scheduling policy gives a rate vector in the interior of the 
ergodic capacity region. If the expected rates of the arrival 
process equals the rates on the boundary point, there must 
be some user i who has Eh {rf (h, w)} < pi and its queue 
expands infinitely. ■ 
Following the result in Theorem 1, we can define the weight 
vector H v (w) of a throughput-optimal policy as a function 
only determined by the MAC layer system state w. In this 
way, the classical cross-layer optimization problem can be 
separated into two parts: Finding the optimal weight vector fi 
according to the MAC layer parameters; solving the rate and 
power allocation problem (7) on the physical layer with the 
giving weight vector. Since the second part can be efficiently 
solved on the physical layer, the scheduling design problem 
reduced to find the optimal weight vector for the optimization 
problem. 

IV. Delay-optimal scheduling policy 

So far we characterize the class of throughput-optimal 
scheduling policies. Since the stability definition doesn't re- 
strict the explicit length of the queues, even throughput- 
optimal policies have different delay performance. In the 
following we study the scheduling policy minimizing the 



average bit delay D, which is defined as: 
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It can be regarded as the extension of the common definition 
of the queueing delay in [8]. N is the length of the observation 
time window and en is the average bit arrival rate for the user 
i in the time window. 

A. Delay-optimal scheduling policy for a static channel 

We consider first the delay optimization problem for a static 
channel h and the initial buffer states q(n = 1). We denote 
the previous average arrival rates as a and assume there is no 
packet arriving after n = 0. Further we choose the length of 
observation time window N with qi(N) = 0, Vi E M so that 
the buffers are completely emptied within the time window. 

Thus the delay-optimal scheduling policy can be written as 
the solution of the optimization problem 
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where <?", r™ denote the queue length and transmit rate 
of user i in time slot n. For convenience we also use the 
superscript to denote the time slot in the following. Extending 
the problem (13) in each queue state q™ we have the equivalent 
optimization problem 
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The Lagrangian function is 
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and the delay-optimization problem is transformed into 
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which can be solved easily. 
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Algorithm 1 Idle State Prediction Algorithm 
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(1) Set [i 
r. 

(2) Initialize the length of non-idle state r/f^ 

(3) Set the order 7r so that 

(4) Set t = 0. 
repeat 
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(5.1) Set 77 
for i = 1 to M do 
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(5.2.1) 77* 
repeat 

(5.2.2.1) Increase 77* . 



Solve the maximization 
problem (17) and calculate the evolution of the 
queue state, 
if qf^ > then 

%{%) " ^(i) 

end if . 

""til q[^f < 
end for 

t = t+l 
until - rif _1) < e, Vt £ M 
(3) 77* = i,W 

[77] denotes the smallest integer larger than r). 
e is the predefined error tolerance of r\. 



The 77* in (16) can be obtained with a iterative approach 
given in Algorithm 1. 

Theorem 2: tjW obtained in Algorithm 1 converges to the 
to 77* which gives the optimal /x" for the delay-optimization 
problem (13). 

Proof: In any time slot n > r]*, we have fif = which 
means the buffer of i-th user is empty at the ra-th time slot. In 
any n < 77*, the i-th buffer must be non-empty. Therefore if 
fj(t) — 77*, we have (ft (fa*]) = 0, Mi e M and the algorithm 
stops at the optimum. 

For two users i,j e M, if 77* > 77*, the optimal weight 
factors 

n n nl-n+i ± (0) 
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follows. 

From (18) we have 
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Therefore, 77*,^ > 77*, holds if i < j. Further, since 77^ = 



^f 1 < V*uy yi e M > the initial state ^| 0) ^ »?i • V * e In 



each iteration step, if the optimum 77* is not achieved, 7/{* +1 ^ 
can always be increased so that tj| 
the convergence of the algorithm is proven 



(t+i) > v W t vi e M. Hence, 



B. Delay-optimal scheduling for dynamic channels 

It is worth noting that if channel state h varies over time 
and the base station has the knowledge of each channel state 
in advance, the algorithm in previous subsection can also 
be used in this case with some modification. However, in 
reality the base station has usually only the current channel 
state information and the statistical knowledge of the channel. 
Further the packet arrival process is non-ergodic and cannot 
be predicted. In order to avoid the possible infinite queueing 
delay, the delay-optimal policy must also be throughput- 
optimal, so that the queue state is kept stable for any expected 
arrival rate p inside the ergodic capacity region. 

If no new packet arrives after the time slot n = 0, the 
expected delay for a given policy V is 

( N M \ 



* EE*? 



JV 



= E E 



I M 1 
\l=\ 



M 

^ (N - n) r ' 



Vn 



a. 



(19) 



where rf n is the rate allocated by the policy V for the i-th 
user at n-th time slot. From Theorem 1 we know that if V is 
a throughput-optimal policy, then 
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where pF is independent of the current channel state. Hence 
the optimization problem is equivalent to 
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Then the optimization problem can be solved using algo- 
rithm 2. 

In the system with new packet arrival, the weight vector p} 
should be recalculated according to the new queue state and 
the rate allocation is determined with p} and current channel 
state h. 

Theorem 3: The proposed scheduling policy keeps the 
queue lengths finite for all arrival processes with expected 
arrival rates inside the capacity region. 

Sketch of the proof: It is easy to show that pj monotonically 
increases from to infinity if tft grows from to infinity, 
for all i e M. Therefore, the expected transmit rate vector 
E{r*} converges to a boundary point f where £t = , 
6 A4. Since p lies inside the ergodic region, we have 
r% > Pi, Vi G M. and the theorem follows. 



Algorithm 2 Delay Optimal Scheduling 
for each time slot n do 

1 Calculate the pervious average arrival rate a 

2 Calculate rj* according to a and current queue state 
q using algorithm 1, where the static channel region 
C(h, P) is replaced with the ergodic capacity region 

Cerg{P)- 

3 Calculate the current weight vector p} according to 
equation (16). 

4 Calculate the current rate allocation 

r* = arg max {ji 1 ) T • r, (22) 

rSC(h",p) 

where h™ is the current channel state, 
end for 



V. Numerical Evaluations 

We compare our scheduler with the LQHPR scheduler in [ 1 ] 
and the QPS in [4] in a two-user scenario. The OFDM system 
has 250 subcarriers and an entire bandwidth of 2.5MHz. The 
multipath channel has i.i.d block fading model and the length 
of fading block T was assumed to be 0.1ms. For an average 
transmit SNR of 15dB the ergodic capacity region was shown 
in Fig.l. Having chosen p x = [1.5,2,2.5,3,3.5,4,4] Mbits/s 
and p2 = [3,4,5,6,7,8,10] Mbits/s the average bit delay is 
shown in Fig. 2. It can be seen that for the arrival rate outside 
the capacity region, the queueing delay becomes extremely 
long. For the arrival rate inside the capacity region, the 
introduced scheduling policy has superior delay performance 
compared to the other two scheduling policies. 
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Fig. 2. Average bit delay 



VI. Conclusion 

We have provided a throughput and delay optimal schedul- 
ing policy for OFDM BC channels. Simulation results show 
that the average bit delay is significantly reduced with the 
introduced scheduling policies. 

Since the weighted sum rate maximization problem can 
be solved for any systems with convex capacity regions, i.e., 
MIMO and OFDM uplink/downlink channel, most of results 
presented in this paper can also be applied to these systems. 
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Fig. 1. Ergodic capacity region for 2 users 



